Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 4277 |
| Missing cells | 1417 |
| Missing cells (%) | 2.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 534.8 KiB |
| Average record size in memory | 128.0 B |
Variable types
| Text | 3 |
|---|---|
| Categorical | 4 |
| Boolean | 2 |
| Numeric | 7 |
VIP is highly imbalanced (87.2%) | Imbalance |
HomePlanet has 87 (2.0%) missing values | Missing |
CryoSleep has 93 (2.2%) missing values | Missing |
Cabin has 100 (2.3%) missing values | Missing |
Destination has 92 (2.2%) missing values | Missing |
Age has 91 (2.1%) missing values | Missing |
VIP has 93 (2.2%) missing values | Missing |
RoomService has 82 (1.9%) missing values | Missing |
FoodCourt has 106 (2.5%) missing values | Missing |
ShoppingMall has 98 (2.3%) missing values | Missing |
Spa has 101 (2.4%) missing values | Missing |
VRDeck has 80 (1.9%) missing values | Missing |
Name has 94 (2.2%) missing values | Missing |
Cabin_deck has 100 (2.3%) missing values | Missing |
Cabin_num has 100 (2.3%) missing values | Missing |
Cabin_side has 100 (2.3%) missing values | Missing |
PassengerId has unique values | Unique |
Age has 82 (1.9%) zeros | Zeros |
RoomService has 2726 (63.7%) zeros | Zeros |
FoodCourt has 2690 (62.9%) zeros | Zeros |
ShoppingMall has 2744 (64.2%) zeros | Zeros |
Spa has 2611 (61.0%) zeros | Zeros |
VRDeck has 2757 (64.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-22 16:13:07.677682 |
|---|---|
| Analysis finished | 2024-04-22 16:14:52.972174 |
| Duration | 1 minute and 45.29 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
PassengerId
Text
UNIQUE 
| Distinct | 4277 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 29939 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4277 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0013_01 |
|---|---|
| 2nd row | 0018_01 |
| 3rd row | 0019_01 |
| 4th row | 0021_01 |
| 5th row | 0023_01 |
| Value | Count | Frequency (%) |
| 0013_01 | 1 | < 0.1% |
| 0046_02 | 1 | < 0.1% |
| 0075_01 | 1 | < 0.1% |
| 0019_01 | 1 | < 0.1% |
| 0021_01 | 1 | < 0.1% |
| 0023_01 | 1 | < 0.1% |
| 0027_01 | 1 | < 0.1% |
| 0029_01 | 1 | < 0.1% |
| 0032_01 | 1 | < 0.1% |
| 0032_02 | 1 | < 0.1% |
| Other values (4267) | 4267 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
HomePlanet
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 87 |
| Missing (%) | 2.0% |
| Memory size | 33.5 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0183771 |
| Min length | 4 |
Characters and Unicode
| Total characters | 21027 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Earth |
|---|---|
| 2nd row | Earth |
| 3rd row | Europa |
| 4th row | Europa |
| 5th row | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 2263 | |
| Europa | 1002 | |
| Mars | 925 | |
| (Missing) | 87 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| earth | 2263 | |
| europa | 1002 | |
| mars | 925 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
CryoSleep
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93 |
| Missing (%) | 2.2% |
| Memory size | 33.5 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 93 |
| Value | Count | Frequency (%) |
| False | 2640 | |
| True | 1544 | |
| (Missing) | 93 | 2.2% |
Cabin
Text
MISSING 
| Distinct | 3265 |
|---|---|
| Distinct (%) | 78.2% |
| Missing | 100 |
| Missing (%) | 2.3% |
| Memory size | 33.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0813981 |
| Min length | 5 |
Characters and Unicode
| Total characters | 29579 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2714 ? |
|---|---|
| Unique (%) | 65.0% |
Sample
| 1st row | G/3/S |
|---|---|
| 2nd row | F/4/S |
| 3rd row | C/0/S |
| 4th row | C/1/S |
| 5th row | F/5/S |
| Value | Count | Frequency (%) |
| g/160/p | 8 | 0.2% |
| g/748/s | 7 | 0.2% |
| b/31/p | 7 | 0.2% |
| e/228/s | 7 | 0.2% |
| d/273/s | 7 | 0.2% |
| c/31/s | 6 | 0.1% |
| b/242/p | 6 | 0.1% |
| c/295/p | 6 | 0.1% |
| g/597/p | 6 | 0.1% |
| g/737/s | 6 | 0.1% |
| Other values (3255) | 4111 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Destination
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 92 |
| Missing (%) | 2.2% |
| Memory size | 33.5 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.185424 |
| Min length | 11 |
Characters and Unicode
| Total characters | 46811 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAPPIST-1e |
|---|---|
| 2nd row | TRAPPIST-1e |
| 3rd row | 55 Cancri e |
| 4th row | TRAPPIST-1e |
| 5th row | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 2956 | |
| 55 Cancri e | 841 | 19.7% |
| PSO J318.5-22 | 388 | 9.1% |
| (Missing) | 92 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trappist-1e | 2956 | |
| 55 | 841 | 13.4% |
| cancri | 841 | 13.4% |
| e | 841 | 13.4% |
| pso | 388 | 6.2% |
| j318.5-22 | 388 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| S | 3344 | 7.1% |
| - | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| A | 2956 | 6.3% |
| I | 2956 | 6.3% |
| R | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| S | 3344 | 7.1% |
| - | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| A | 2956 | 6.3% |
| I | 2956 | 6.3% |
| R | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| S | 3344 | 7.1% |
| - | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| A | 2956 | 6.3% |
| I | 2956 | 6.3% |
| R | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| S | 3344 | 7.1% |
| - | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| A | 2956 | 6.3% |
| I | 2956 | 6.3% |
| R | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Age
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 79 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 91 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.658146 |
| Minimum | 0 |
|---|---|
| Maximum | 79 |
| Zeros | 82 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 19 |
| median | 26 |
| Q3 | 37 |
| 95-th percentile | 55 |
| Maximum | 79 |
| Range | 79 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.179072 |
|---|---|
| Coefficient of variation (CV) | 0.49476583 |
| Kurtosis | 0.21852293 |
| Mean | 28.658146 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.48480029 |
| Sum | 119963 |
| Variance | 201.04607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 176 | 4.1% |
| 22 | 163 | 3.8% |
| 19 | 162 | 3.8% |
| 20 | 160 | 3.7% |
| 24 | 158 | 3.7% |
| 21 | 157 | 3.7% |
| 25 | 156 | 3.6% |
| 23 | 144 | 3.4% |
| 26 | 132 | 3.1% |
| 27 | 127 | 3.0% |
| Other values (69) | 2651 |
| Value | Count | Frequency (%) |
| 0 | 82 | |
| 1 | 27 | 0.6% |
| 2 | 35 | |
| 3 | 34 | |
| 4 | 20 | 0.5% |
| 5 | 20 | 0.5% |
| 6 | 25 | 0.6% |
| 7 | 13 | 0.3% |
| 8 | 24 | 0.6% |
| 9 | 21 | 0.5% |
| Value | Count | Frequency (%) |
| 79 | 2 | < 0.1% |
| 78 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 74 | 2 | < 0.1% |
| 73 | 5 | |
| 72 | 3 | |
| 71 | 2 | < 0.1% |
| 70 | 2 | < 0.1% |
| 69 | 6 |
VIP
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93 |
| Missing (%) | 2.2% |
| Memory size | 33.5 KiB |
| False | |
|---|---|
| True | 74 |
| (Missing) | 93 |
| Value | Count | Frequency (%) |
| False | 4110 | |
| True | 74 | 1.7% |
| (Missing) | 93 | 2.2% |
RoomService
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 842 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 82 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 219.26627 |
| Minimum | 0 |
|---|---|
| Maximum | 11567 |
| Zeros | 2726 |
| Zeros (%) | 63.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 53 |
| 95-th percentile | 1274.5 |
| Maximum | 11567 |
| Range | 11567 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 607.01129 |
|---|---|
| Coefficient of variation (CV) | 2.7683751 |
| Kurtosis | 53.216268 |
| Mean | 219.26627 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.5583897 |
| Sum | 919822 |
| Variance | 368462.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2726 | |
| 1 | 68 | 1.6% |
| 2 | 34 | 0.8% |
| 3 | 28 | 0.7% |
| 4 | 24 | 0.6% |
| 6 | 16 | 0.4% |
| 5 | 15 | 0.4% |
| 9 | 13 | 0.3% |
| 8 | 12 | 0.3% |
| 13 | 11 | 0.3% |
| Other values (832) | 1248 | |
| (Missing) | 82 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 2726 | |
| 1 | 68 | 1.6% |
| 2 | 34 | 0.8% |
| 3 | 28 | 0.7% |
| 4 | 24 | 0.6% |
| 5 | 15 | 0.4% |
| 6 | 16 | 0.4% |
| 7 | 8 | 0.2% |
| 8 | 12 | 0.3% |
| 9 | 13 | 0.3% |
| Value | Count | Frequency (%) |
| 11567 | 1 | |
| 7407 | 1 | |
| 6438 | 1 | |
| 5900 | 1 | |
| 5862 | 1 | |
| 5454 | 1 | |
| 5333 | 1 | |
| 5100 | 1 | |
| 4922 | 1 | |
| 4908 | 1 |
FoodCourt
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 902 |
|---|---|
| Distinct (%) | 21.6% |
| Missing | 106 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 439.4843 |
| Minimum | 0 |
|---|---|
| Maximum | 25273 |
| Zeros | 2690 |
| Zeros (%) | 62.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 78 |
| 95-th percentile | 2518.5 |
| Maximum | 25273 |
| Range | 25273 |
| Interquartile range (IQR) | 78 |
Descriptive statistics
| Standard deviation | 1527.663 |
|---|---|
| Coefficient of variation (CV) | 3.4760356 |
| Kurtosis | 67.764434 |
| Mean | 439.4843 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.9106254 |
| Sum | 1833089 |
| Variance | 2333754.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2690 | |
| 1 | 59 | 1.4% |
| 2 | 30 | 0.7% |
| 4 | 22 | 0.5% |
| 3 | 21 | 0.5% |
| 6 | 20 | 0.5% |
| 5 | 19 | 0.4% |
| 7 | 13 | 0.3% |
| 11 | 12 | 0.3% |
| 10 | 12 | 0.3% |
| Other values (892) | 1273 | |
| (Missing) | 106 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 2690 | |
| 1 | 59 | 1.4% |
| 2 | 30 | 0.7% |
| 3 | 21 | 0.5% |
| 4 | 22 | 0.5% |
| 5 | 19 | 0.4% |
| 6 | 20 | 0.5% |
| 7 | 13 | 0.3% |
| 8 | 11 | 0.3% |
| 9 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 25273 | 1 | |
| 23397 | 1 | |
| 20809 | 1 | |
| 20229 | 1 | |
| 16963 | 1 | |
| 16954 | 1 | |
| 16250 | 1 | |
| 16071 | 1 | |
| 12350 | 1 | |
| 11984 | 1 |
ShoppingMall
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 715 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 98 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 177.29553 |
| Minimum | 0 |
|---|---|
| Maximum | 8292 |
| Zeros | 2744 |
| Zeros (%) | 64.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 994.1 |
| Maximum | 8292 |
| Range | 8292 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 560.82112 |
|---|---|
| Coefficient of variation (CV) | 3.1631995 |
| Kurtosis | 68.221142 |
| Mean | 177.29553 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.8249391 |
| Sum | 740918 |
| Variance | 314520.33 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2744 | |
| 1 | 72 | 1.7% |
| 3 | 35 | 0.8% |
| 2 | 32 | 0.7% |
| 4 | 24 | 0.6% |
| 7 | 19 | 0.4% |
| 9 | 17 | 0.4% |
| 8 | 16 | 0.4% |
| 12 | 13 | 0.3% |
| 10 | 12 | 0.3% |
| Other values (705) | 1195 | |
| (Missing) | 98 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 2744 | |
| 1 | 72 | 1.7% |
| 2 | 32 | 0.7% |
| 3 | 35 | 0.8% |
| 4 | 24 | 0.6% |
| 5 | 11 | 0.3% |
| 6 | 12 | 0.3% |
| 7 | 19 | 0.4% |
| 8 | 16 | 0.4% |
| 9 | 17 | 0.4% |
| Value | Count | Frequency (%) |
| 8292 | 1 | |
| 8251 | 1 | |
| 8098 | 1 | |
| 8017 | 1 | |
| 7022 | 1 | |
| 6252 | 1 | |
| 6108 | 1 | |
| 6061 | 1 | |
| 6023 | 1 | |
| 5649 | 1 |
Spa
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 833 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 101 |
| Missing (%) | 2.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 303.05244 |
| Minimum | 0 |
|---|---|
| Maximum | 19844 |
| Zeros | 2611 |
| Zeros (%) | 61.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 50 |
| 95-th percentile | 1525 |
| Maximum | 19844 |
| Range | 19844 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 1117.186 |
|---|---|
| Coefficient of variation (CV) | 3.6864445 |
| Kurtosis | 80.460402 |
| Mean | 303.05244 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6902979 |
| Sum | 1265547 |
| Variance | 1248104.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2611 | |
| 1 | 72 | 1.7% |
| 2 | 43 | 1.0% |
| 3 | 29 | 0.7% |
| 4 | 27 | 0.6% |
| 6 | 23 | 0.5% |
| 8 | 22 | 0.5% |
| 7 | 19 | 0.4% |
| 5 | 16 | 0.4% |
| 9 | 16 | 0.4% |
| Other values (823) | 1298 | |
| (Missing) | 101 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 2611 | |
| 1 | 72 | 1.7% |
| 2 | 43 | 1.0% |
| 3 | 29 | 0.7% |
| 4 | 27 | 0.6% |
| 5 | 16 | 0.4% |
| 6 | 23 | 0.5% |
| 7 | 19 | 0.4% |
| 8 | 22 | 0.5% |
| 9 | 16 | 0.4% |
| Value | Count | Frequency (%) |
| 19844 | 1 | |
| 15733 | 1 | |
| 15255 | 1 | |
| 14252 | 1 | |
| 13983 | 1 | |
| 12842 | 1 | |
| 12767 | 1 | |
| 12690 | 1 | |
| 12437 | 1 | |
| 11483 | 1 |
VRDeck
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 796 |
|---|---|
| Distinct (%) | 19.0% |
| Missing | 80 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 310.71003 |
| Minimum | 0 |
|---|---|
| Maximum | 22272 |
| Zeros | 2757 |
| Zeros (%) | 64.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 36 |
| 95-th percentile | 1536.8 |
| Maximum | 22272 |
| Range | 22272 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 1246.9947 |
|---|---|
| Coefficient of variation (CV) | 4.0133714 |
| Kurtosis | 93.842398 |
| Mean | 310.71003 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.38721 |
| Sum | 1304050 |
| Variance | 1554995.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2757 | |
| 1 | 72 | 1.7% |
| 2 | 38 | 0.9% |
| 3 | 33 | 0.8% |
| 7 | 23 | 0.5% |
| 6 | 21 | 0.5% |
| 4 | 20 | 0.5% |
| 5 | 17 | 0.4% |
| 8 | 10 | 0.2% |
| 19 | 10 | 0.2% |
| Other values (786) | 1196 | |
| (Missing) | 80 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 2757 | |
| 1 | 72 | 1.7% |
| 2 | 38 | 0.9% |
| 3 | 33 | 0.8% |
| 4 | 20 | 0.5% |
| 5 | 17 | 0.4% |
| 6 | 21 | 0.5% |
| 7 | 23 | 0.5% |
| 8 | 10 | 0.2% |
| 9 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 22272 | 1 | |
| 19086 | 1 | |
| 18670 | 1 | |
| 16514 | 1 | |
| 15940 | 1 | |
| 15125 | 1 | |
| 14834 | 1 | |
| 14587 | 1 | |
| 14268 | 1 | |
| 12863 | 1 |
Name
Text
MISSING 
| Distinct | 4176 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 94 |
| Missing (%) | 2.2% |
| Memory size | 33.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 13.756634 |
| Min length | 7 |
Characters and Unicode
| Total characters | 57544 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4169 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | Nelly Carsoning |
|---|---|
| 2nd row | Lerome Peckers |
| 3rd row | Sabih Unhearfus |
| 4th row | Meratz Caltilter |
| 5th row | Brence Harperez |
| Value | Count | Frequency (%) |
| extraly | 14 | 0.2% |
| hopperett | 13 | 0.2% |
| tranklinay | 11 | 0.1% |
| apple | 10 | 0.1% |
| garrez | 10 | 0.1% |
| dickley | 10 | 0.1% |
| petton | 9 | 0.1% |
| logannon | 9 | 0.1% |
| brie | 9 | 0.1% |
| emenez | 9 | 0.1% |
| Other values (3821) | 8262 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Cabin_deck
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 100 |
| Missing (%) | 2.3% |
| Memory size | 33.5 KiB |
| F | |
|---|---|
| G | |
| E | |
| B | |
| C | |
| Other values (3) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4177 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | G |
|---|---|
| 2nd row | F |
| 3rd row | C |
| 4th row | C |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 1445 | |
| G | 1222 | |
| E | 447 | 10.5% |
| B | 362 | 8.5% |
| C | 355 | 8.3% |
| D | 242 | 5.7% |
| A | 98 | 2.3% |
| T | 6 | 0.1% |
| (Missing) | 100 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 1445 | |
| g | 1222 | |
| e | 447 | 10.7% |
| b | 362 | 8.7% |
| c | 355 | 8.5% |
| d | 242 | 5.8% |
| a | 98 | 2.3% |
| t | 6 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 1445 | |
| G | 1222 | |
| E | 447 | 10.7% |
| B | 362 | 8.7% |
| C | 355 | 8.5% |
| D | 242 | 5.8% |
| A | 98 | 2.3% |
| T | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 1445 | |
| G | 1222 | |
| E | 447 | 10.7% |
| B | 362 | 8.7% |
| C | 355 | 8.5% |
| D | 242 | 5.8% |
| A | 98 | 2.3% |
| T | 6 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 1445 | |
| G | 1222 | |
| E | 447 | 10.7% |
| B | 362 | 8.7% |
| C | 355 | 8.5% |
| D | 242 | 5.8% |
| A | 98 | 2.3% |
| T | 6 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 1445 | |
| G | 1222 | |
| E | 447 | 10.7% |
| B | 362 | 8.7% |
| C | 355 | 8.5% |
| D | 242 | 5.8% |
| A | 98 | 2.3% |
| T | 6 | 0.1% |
Cabin_num
Real number (ℝ)
MISSING 
| Distinct | 1505 |
|---|---|
| Distinct (%) | 36.0% |
| Missing | 100 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 610.17884 |
| Minimum | 0 |
|---|---|
| Maximum | 1890 |
| Zeros | 7 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 174 |
| median | 442 |
| Q3 | 1027 |
| 95-th percentile | 1576.2 |
| Maximum | 1890 |
| Range | 1890 |
| Interquartile range (IQR) | 853 |
Descriptive statistics
| Standard deviation | 514.96813 |
|---|---|
| Coefficient of variation (CV) | 0.84396262 |
| Kurtosis | -0.78044905 |
| Mean | 610.17884 |
| Median Absolute Deviation (MAD) | 341 |
| Skewness | 0.68395888 |
| Sum | 2548717 |
| Variance | 265192.18 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 21 | 0.5% |
| 31 | 18 | 0.4% |
| 294 | 16 | 0.4% |
| 197 | 16 | 0.4% |
| 34 | 14 | 0.3% |
| 228 | 14 | 0.3% |
| 41 | 13 | 0.3% |
| 231 | 13 | 0.3% |
| 160 | 13 | 0.3% |
| 184 | 13 | 0.3% |
| Other values (1495) | 4026 | |
| (Missing) | 100 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.2% |
| 1 | 5 | 0.1% |
| 2 | 5 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 21 | |
| 5 | 6 | 0.1% |
| 6 | 5 | 0.1% |
| 7 | 12 | |
| 8 | 5 | 0.1% |
| 9 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 1890 | 1 | |
| 1887 | 1 | |
| 1885 | 1 | |
| 1883 | 1 | |
| 1882 | 1 | |
| 1881 | 1 | |
| 1879 | 1 | |
| 1874 | 2 | |
| 1869 | 1 | |
| 1862 | 1 |
Cabin_side
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 100 |
| Missing (%) | 2.3% |
| Memory size | 33.5 KiB |
| S | |
|---|---|
| P |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4177 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | S |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 2093 | |
| P | 2084 | |
| (Missing) | 100 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 2093 | |
| p | 2084 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2093 | |
| P | 2084 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 2093 | |
| P | 2084 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 2093 | |
| P | 2084 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4177 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 2093 | |
| P | 2084 |
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | Cabin_deck | Cabin_num | Cabin_side | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0013_01 | Earth | True | G/3/S | TRAPPIST-1e | 27.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Nelly Carsoning | G | 3 | S |
| 1 | 0018_01 | Earth | False | F/4/S | TRAPPIST-1e | 19.0 | False | 0.0 | 9.0 | 0.0 | 2823.0 | 0.0 | Lerome Peckers | F | 4 | S |
| 2 | 0019_01 | Europa | True | C/0/S | 55 Cancri e | 31.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Sabih Unhearfus | C | 0 | S |
| 3 | 0021_01 | Europa | False | C/1/S | TRAPPIST-1e | 38.0 | False | 0.0 | 6652.0 | 0.0 | 181.0 | 585.0 | Meratz Caltilter | C | 1 | S |
| 4 | 0023_01 | Earth | False | F/5/S | TRAPPIST-1e | 20.0 | False | 10.0 | 0.0 | 635.0 | 0.0 | 0.0 | Brence Harperez | F | 5 | S |
| 5 | 0027_01 | Earth | False | F/7/P | TRAPPIST-1e | 31.0 | False | 0.0 | 1615.0 | 263.0 | 113.0 | 60.0 | Karlen Ricks | F | 7 | P |
| 6 | 0029_01 | Europa | True | B/2/P | 55 Cancri e | 21.0 | False | 0.0 | NaN | 0.0 | 0.0 | 0.0 | Aldah Ainserfle | B | 2 | P |
| 7 | 0032_01 | Europa | True | D/0/S | TRAPPIST-1e | 20.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Acrabi Pringry | D | 0 | S |
| 8 | 0032_02 | Europa | True | D/0/S | 55 Cancri e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Dhena Pringry | D | 0 | S |
| 9 | 0033_01 | Earth | False | F/7/S | 55 Cancri e | 24.0 | False | 0.0 | 639.0 | 0.0 | 0.0 | 0.0 | Eliana Delazarson | F | 7 | S |
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | Cabin_deck | Cabin_num | Cabin_side | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4267 | 9260_01 | Earth | True | G/1503/P | 55 Cancri e | 3.0 | NaN | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Luisy Portananney | G | 1503 | P |
| 4268 | 9262_01 | Earth | False | F/1795/S | 55 Cancri e | 20.0 | False | 0.0 | 601.0 | 103.0 | 35.0 | 0.0 | Sonald Hurchrisong | F | 1795 | S |
| 4269 | 9263_01 | Earth | True | G/1495/S | TRAPPIST-1e | 43.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Loisey Heney | G | 1495 | S |
| 4270 | 9265_01 | Mars | False | D/278/S | TRAPPIST-1e | 43.0 | False | 47.0 | 0.0 | 3851.0 | 0.0 | 0.0 | Toate Cure | D | 278 | S |
| 4271 | 9266_01 | Earth | False | F/1796/S | TRAPPIST-1e | 40.0 | False | 0.0 | 865.0 | 0.0 | 3.0 | 0.0 | Danna Peter | F | 1796 | S |
| 4272 | 9266_02 | Earth | True | G/1496/S | TRAPPIST-1e | 34.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Jeron Peter | G | 1496 | S |
| 4273 | 9269_01 | Earth | False | NaN | TRAPPIST-1e | 42.0 | False | 0.0 | 847.0 | 17.0 | 10.0 | 144.0 | Matty Scheron | NaN | NaN | NaN |
| 4274 | 9271_01 | Mars | True | D/296/P | 55 Cancri e | NaN | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Jayrin Pore | D | 296 | P |
| 4275 | 9273_01 | Europa | False | D/297/P | NaN | NaN | False | 0.0 | 2680.0 | 0.0 | 0.0 | 523.0 | Kitakan Conale | D | 297 | P |
| 4276 | 9277_01 | Earth | True | G/1498/S | PSO J318.5-22 | 43.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Lilace Leonzaley | G | 1498 | S |